Image processing apparatus and computer program for generating a file in which plural area images of a document image are disposed in different pages

ABSTRACT

An image processing apparatus includes a processor, and memory storing computer readable instructions, when executed by the processor, causing the apparatus to function as a document image acquiring unit configured to acquire document image data which is generated by optically reading a document and represents a document image, an area image generating unit configured to use the document image data to generate plural area image data representing plural area images that are included in the document image, and a file generating unit configured to generate a file including the plural area image data such that the plural area images are disposed in different pages.

CROSS-REFERENCE TO RELATED APPLICATION

This application claims priority from Japanese Patent Application No.2012-148085, filed on Jul. 1, 2012, the entire subject matter of whichis incorporated herein by reference.

TECHNICAL FIELD

Aspects of the present invention relate to a technique of using documentimage data representing document images to generate a file includingplural pages.

BACKGROUND

It is common to read sheets (documents) by a scanner and store documentimage data representing document images. In order to store documentimages in a form convenient for a user, various image processingtechniques of processing document image data have been proposed. Forexample, JP-A-2010-187056 describes a technique of using one documentimage data obtained by reading one page of a document having pluralimages printed thereon to generate plural image data representing theplural images, respectively.

SUMMARY

It is demanded to improve user convenience in storing a document imageincluding plural images.

An aspect of the present invention provides a technique of storing adocument image including plural images in a convenient form.

The present invention has been made in view of the above circumstancesand provides the following illustrative embodiment.

According to an illustrative embodiment, there is provided an imageprocessing apparatus comprising: a processor; and memory storingcomputer readable instructions, when executed by the processor, causingthe apparatus to function as: a document image acquiring unit configuredto acquire document image data which is generated by optically reading adocument and represents a document image; an area image generating unitconfigured to use the document image data to generate plural area imagedata representing plural area images that are included in the documentimage; and a file generating unit configured to generate a fileincluding the plural area image data such that the plural area imagesare disposed in different pages.

According to the above-mentioned configuration, it is possible to usethe document image data to generate a file in which the plural areaimages included in the document image are disposed on different pages.Therefore, it is possible to store the document image including theplural area images in a convenient form.

The present invention can be implemented in various forms such as amethod of implementing the function of the above-mentioned apparatus, acomputer program for implementing the function of the above-mentionedapparatus, and a recording medium having the computer program recordedthereon.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and other aspects of the present invention will become moreapparent and more readily appreciated from the following description ofillustrative embodiments of the present invention taken in conjunctionwith the attached drawings, in which:

FIG. 1 is a block diagram illustrating a configuration of a documentstoring system 1000;

FIG. 2 is a flow chart illustrating a template image data generatingprocess;

FIGS. 3A to 3C are views illustrating examples of a template image TI;

FIGS. 4A and 4B are views illustrating use modes of a memo sheet MS;

FIG. 5 is a flow chart illustrating a division document generatingprocess;

FIGS. 6A to 6C are explanatory views illustrating the division documentgenerating process;

FIGS. 7A to 7C are explanatory views illustrating the division documentgenerating process;

FIG. 8 is a view illustrating an example of a management table MT;

FIG. 9 is a flow chart illustrating an area image data generatingprocess; and

FIGS. 10A and 10B are views illustrating generation of a PDF file.

DETAILED DESCRIPTION A. Illustrative Embodiment A-1. Configuration ofDocument Storing System 1000

FIG. 1 is a block diagram illustrating a configuration of a documentstoring system 1000. The document storing system 1000 includes an imageprocess server 200, a multi-function apparatus 300, a personal computer400, and a storage server 500. The multi-function apparatus 300 and thepersonal computer 400 are used by the same user, and are connected toeach other through a LAN 600 to be communicable with each other. Themulti-function apparatus 300 and the personal computer 400 are connectedto the Internet 700 through the LAN 600. Each of the image processserver 200 and the storage server 500 is connected to the Internet 700.

The multi-function apparatus 300 includes a CPU 310 for controlling theentire multi-function apparatus 300, a volatile storage device 330 suchas a RAM, a non-volatile storage device 340 such as a flash memory, acommunication unit 320 for connection with an external device (such as adevice to which the multi-function apparatus 300 can be connectedthrough a network, or an external storage device such as a USB memory),an operating unit 350 including various buttons, a display unit 360 suchas a liquid crystal panel, a printing unit 370 which is an ink-jet orlaser printer engine, and an image reading unit 380.

The image reading unit 380 is a scanner which uses a one-dimensionalimage sensor having plural photoelectric conversion elements such asCCDs or CMOSs lined up in row to optically read a document to generatedocument image data (scan data). The image reading unit 380 may include,for example, an automatic document feeder to sequentially read pluralpages of a document, thereby automatically generating plural documentimage data.

The non-volatile storage device 340 stores a computer program 341. Thecomputer program 341 can be recorded on a CD-ROM or the like to beprovided. The CPU 310 executes the program 341 to function as anapparatus control unit 312 for controlling the entire multi-functionapparatus 300. The apparatus control unit 312 includes a template imageacquiring unit 314, and a document image transmitting unit 316. Thesefunctional units perform processes to be described below.

The image process server 200 includes a CPU 210, a volatile storagedevice 230 such as a RAM, a non-volatile storage device 240 such as ahard disk, and a communication unit 220 including an interface forconnection with the Internet 700. The non-volatile storage device 240stores a computer program 241. The computer program 241 can be recordedon a CD-ROM or the like to be provided.

The CPU 210 executes the computer program 241 to function as a templateimage generating unit 50 and a document image processing unit 100. Thedocument image processing unit 100 includes a document image acquiringunit 110, an image information acquiring unit 120, a page numberacquiring unit 130, a generation order specifying unit 140, anunnecessity condition determining unit 150, an area image generatingunit 160, and a file generating unit 170. These functional units performprocesses to be described below.

The storage server 500 is a general computer, and includes a CPU 510which executes a server program (not shown) to provide a service tostore or save user's data. Specifically, a user can store document imagedata representing a document in the storage server 500 through themulti-function apparatus 300 and the image process server 200 as will bedescribed below.

The personal computer 400 includes a CPU 410 which executes a program tofunction as a driver (not shown) for controlling the multi-functionapparatus 300, or a client to access the image process server 200. Theuser can use the personal computer 400 to access to document image datastored in the storage server 500.

A-2. Operation of Document Storing System 1000

The operation of the document storing system 1000 will be described witha focus on a process which the image process server 200 performs.

A-2-1: Template Image Data Generating Process

FIG. 2 is a flow chart illustrating a template image data generatingprocess. The template image data generating process is a process ofgenerating template image data for generating a memo sheet. If the imageprocess server 200 receives a generation request to request generationof template image data, the template image data generating process isperformed by the template image generating unit 50 of the image processserver 200. The template image acquiring unit 314 of the multi-functionapparatus 300 transmits the generation request to the image processserver 200 according to an instruction of the user of the multi-functionapparatus 300. For example, the user's instruction is a printinstruction to cause the printing unit 370 of the multi-functionapparatus 300 to print a template image of a memo sheet.

If receiving the generation request from the multi-function apparatus300 in Step S110, in Step S120, the template image generating unit 50acquires setting information included in the received generationrequest. As shown in FIG. 2, the setting information includes layoutinformation designated by the user. The layout information includes anallocation number AN, a sheet size PS, and a ruled line type LT to bedescribed below. The sheet size PS is the size of sheet predetermined tobe used for printing a template image, and is any one of general sheetsizes for printing, such as A4, A3, B5, B4, and so on. Examples of theruled line type LT include ruled lines of writing sheet (horizontalruled lines), ruled lines of graph sheet or manuscript sheet (squareruled lines), and so on. All or part of the layout information may bepredetermined and fixed values. The setting information may includeother information such as keyword information, in addition to the layoutinformation.

In Step S130, the template image generating unit 50 uses the settinginformation including the layout information to determine the layout ofa template image and the contents of unit images 10 to be disposed inthe template image. In Step S140, the template image generating unit 50generates template image data having the unit image 10 representing thedetermined contents and disposed in the determined layout.

FIGS. 3A to 3C are views illustrating examples of a template image TIwhich is represented by template image data. FIGS. 4A and 4B are viewillustrating use modes of a memo sheet MS. FIG. 3A shows an example ofthe unit image 10. FIG. 3B shows a first type of template image TIAhaving an allocation number AN of 2, and FIG. 3C shows a second type oftemplate image TIB having an allocation number AN of 4. FIG. 4A shows ause mode of a first type of memo sheet MSA having the first type oftemplate image TIA printed thereon, and FIG. 4B shows a use mode of asecond type of memo sheet MSB having the second type of template imageTIB printed thereon. Incidentally, in a case of distinguishing specifictypes of template images TI and memo sheets MS, alphabets (for example,‘A’ or ‘B’) are further added to the ends of their reference symbols.

As shown in FIGS. 3B and 3C, a template image TI includes (AN-number of)unit images 10, the number of which corresponds to the allocation numberAN. In other words, in a case of printing the template image TI on asheet to generate a memo sheet MS, the allocation number AN is thenumber of unit images 10 to be printed on (one side of) one sheet. Thefirst type of template image TIA includes two unit images 10A and 10B,and the second type of template image TIB includes four unit images 10C,10D, 10E, and 10F. Incidentally, in a case of distinguishing specificunit images 10, alphabets (for example, ‘A’ or ‘B’) are further added tothe ends of their reference symbols. This is similarly applied to areasSA of the memo sheets MS to be described below.

The memo sheet MS having the template images TI printed thereon isfolded as shown in FIGS. 4A and 4B and is used in the folded state.Specifically, the first type of memo sheet MSA (FIG. 4A) is folded alonga first folding line which is a line connecting the middle points CP1and CP2 of two long sides of rectangular sheet, one time, and is used inthe folded state. The second type of memo sheet MSB (FIG. 4B) is foldedalong a second folding line which is a line connecting the middle pointsCP3 and CP4 of two long sides of the sheet folded one time (which wereshort sides before the folding), one more time, and is used in thefolded state. By the folding lines when the memo sheet MS has beenfolded, one face of the memo sheet MS is divided into plural areas SA.Specifically, one face of the first type of memo sheet MSA is dividedinto two areas SAA and SAB (FIG. 4A), and one face of the second type ofmemo sheet MSB is divided into four areas SAC, SAD, SAE, and SAF (FIG.4B).

It is assumed that each area SA of the memo sheet MS is used as a memoarea for one page. For example, in the first type of memo sheet MSA, thearea SAA is used as a memo area of a first page, and the area SAB isused as a memo area of a second page. In the second type of memo sheetMSB, the four areas SAC, SAD, SAE, and SAF are used as memo areas offirst to fourth pages, respectively.

The unit image 10 is printed to each of the areas SA of the memo sheetMS, one per area. That is, one unit image 10 is configured to beconvenient when used as a memo area of one page. Specifically, as shownin FIG. 3A, one unit image 10 includes a ruled line area MA whichincludes ruled lines RL of the type designated by the layoutinformation, a header area HA which is adjacent to the upper side of theruled line area MA, and a footer area FA which is adjacent to the lowerside of the ruled line area MA. The ruled line area MA is also called abody text area, and the header area HA and the footer area FA are alsocalled peripheral areas positioned at the periphery of the ruled linearea MA. The ruled line area MA is a substantially blank area whichincludes the ruled lines RL but does not include any substantial objectssuch as characters, figures, and drawings.

The header area HA includes a top marker 11, a page information area 12,a title area 13, and an unnecessity check box 14. The top marker 11 maybe disposed at the left corner of the header area HA (the upper leftcorner of the unit image 10), and have a shape including a part parallelto a horizontal direction, and a part parallel to a vertical direction(a shape obtained by rotating the character “L” 90 degrees clockwise).In the page information area 12, the page number of the unit image 10 inthe template image TI is written. The title area 13 is an area forallowing the user to write a title for each unit image 10. Theunnecessity check box 14 is an area for allowing the user to write acheck mark in a case where it is not necessary to store the unit image10.

The footer area FA includes plural keywords 15, mark areas 16 whichcorrespond to the plural keywords 15, respectively, and a bottom marker17. The keywords 15 are keywords which the user can associate with theunit image 10. The specific contents of the keywords 15 (such as ‘MEMO’,‘MINUTES’, ‘SEMINAR’, and ‘IDEA’) may be determined in advance, or maybe specified according to the keyword information which can be includedin the setting information designated by the user. The mark areas 16 areareas which the user can fill to specify all or some of the pluralkeywords 15. The bottom marker 17 may have a shape obtained by rotatingthe top marker 11 180 degrees clockwise. As can be seen from the above,even if the ruled line area MA is substantially blank, in other words,even if the contents of the ruled line area MA has not been determined,candidates of the keywords 15 to be associated are printed onto the memosheet MS having the template image TI printed thereon.

Each unit image 10 is disposed inside the template image TI such that itis convenient in a case of using a memo sheet MS having the templateimage TI printed thereon in a folded state as described with referenceto FIGS. 4A and 4B.

Specifically, as shown in FIG. 3B, two unit images 10A and 10B of thefirst type of template image TIA are disposed in areas corresponding tothe two areas SAA and SAB shown in FIG. 4A, respectively. The two unitimages 10A and 10B are disposed in the same direction inside the firsttype of template image TIA such that an upper side (an arrow in FIG. 4A)which is assumed in a case where the first template image is used in thefolded state and the upper sides (arrows in FIG. 3B) of the two unitimages 10A and 10B match with each other.

Also, as shown in FIG. 3C, four unit images 10C to 10F of the secondtype of template image TIB are disposed in areas corresponding to thefour areas SAC to SAF shown in FIG. 4B, respectively. Two unit images10C and 10F and two unit images 10D and 10E are disposed in oppositedirection to each other inside the second type of template image TIBsuch that an upper side (an arrow in FIG. 4B) which is assumed in a casewhere the second type of template image TIB is used in the folded stateand the upper sides of the four unit images 10C to 10F (arrows in FIG.3C) match with each other. In other words, the two unit images 10C and10F and the two unit images 10D and 10E are in opposite directions toeach other with a line inside the second type of template image TIBcorresponding to the above-mentioned first folding line and interposedbetween the two unit images 10C and 10F and the two unit images 10D and10E.

In FIGS. 3B and 3C, numerals (following “P-”) shown in the unit images10 show page numbers assigned to the corresponding unit images 10 insidethe template image TI. As can be seen from those numerals, the pagenumbers of the unit images 10 are assigned according to an order of usewhich is assumed in the above-mentioned folded state. As describedabove, each page number is shown in the page information area 12 (FIG.3A).

Also, the template image TI may include a QR code (registered trademark)18 as shown in FIGS. 3B and 3C. The QR code includes various informationusable in a division document generating process to be described below.

As can be appreciated from the above explanation, the sizes of unitimages included in one template image TI are the same each other. In acase where the sheet size is fixed, as the allocation number ANincreases, the sizes of unit images 10 decrease, and as the allocationnumber AN decreases, the sizes of unit images 10 increase. Specifically,in the case where the sheet size is fixed, if the allocation number ANdoubles, the area of each unit image 10 becomes nearly half. Also, in acase where the allocation number AN is fixed, that is, in a case wherethe number of unit images 10 to be included in one template image TI isfixed, as the sheet size increases, the size of each unit image 10increases, and as the sheet size decreases, the size of each unit image10 decreases. Specifically, if the sheet area doubles, the area of eachunit image 10 also doubles. Also, as can be appreciated from theexplanation on the folded states of FIGS. 4A and 4B, in the presentillustrative embodiment, the number of unit images 10 to be included inone template image TI (the allocation number AN) has a value of the Zpower of 2 (here, Z is a natural number). In other words, the allocationnumber AN has any one of 2, 4, 8, 16, 32, . . . .

If the template image data is generated in Step S140, in Step S150, thetemplate image generating unit 50 transmits the template image data tothe multi-function apparatus 300 in response to the generation requestfrom the multi-function apparatus 300, and terminates the process. Asthe transmitted template image data, data of various formats can beused, for example, print data subjected to a rasterizing process or ahalftone process may be used, and vector data such as image datadescribed in a page description language may also be used.

The apparatus control unit 312 of the multi-function apparatus 300 usesthe received template image data to print the template image TI on asheet to generate a memo sheet MS. The generated memo sheet MS can beused by the user.

A-2-2: Division Document Generating Process

FIG. 5 is a flow chart illustrating the division document generatingprocess. The document generating process is a process of generating afile for storing image data representing the contents of the memo sheetMS having various information (referred to as write information) written(handwritten) thereon by the user. The document generating process isperformed by the document image processing unit 100 of the image processserver 200 if the image process server 200 receives document image datafrom the multi-function apparatus 300. The image reading unit 380 of themulti-function apparatus 300 reads the written memo sheet MS to generatedocument image data. The document image data may be stored in onetransmission file to be transmitted to the image process server 200. Ina case where there are plural document image data to be transmitted, theplural document image data are disposed or arranged in a transmissionfile in an order of generation (an order of read). That is, the pluraldocument image data are transmitted such that the generation orderspecifying unit 140 of the image process server 200 can specify theorder of generation of the plural document image data.

FIGS. 6A-6C and 7A-7C are explanatory views illustrating the divisiondocument generating process. First, a document image SI which isrepresented by document image data will be described. FIG. 6A shows afirst type of document image SIA which is represented by a first type ofdocument image data generated by reading a first type of memo sheet MSA,and FIG. 7A shows a second type of document image SIB which isrepresented by a second type of document image data generated by readinga second type of memo sheet MSB. Incidentally, in a case ofdistinguishing document images SI of specific types, alphabets (forexample, ‘A’ or ‘B’) are further added to the ends of their referencesymbols. A document image SI includes plural area images 20corresponding to plural unit images 10 which is included in one templateimage TI printed on one memo sheet MS. Therefore, the number of areaimages 20 which are included in one document image SI is the same as thenumber of unit images 10 which are included in one template image TI,and is the Z power of 2 (here, Z is a natural number), specifically, anyone of 2, 4, 8, 16, 32, . . . . Also, the sizes of multiple area images20 which are included in one document image SI are almost the same, andare almost the same as the sizes of corresponding unit images 10.Specifically, the first type of document image SIA shown in FIGS. 6A-6Cincludes two area images 20A and 20B, and the second type of documentimage SIB shown in FIGS. 7A-7C includes four area images 20C to 20F.Incidentally, in a case of distinguishing specific area images 20,alphabets (for example, ‘A’ or ‘B’) are further added to the ends oftheir reference symbols. The area image 20 may include correspondingunit image 10 (see FIGS. 3A to 3C) and include write information 19(also referred to as write image) written on the memo sheet MS by theuser. The write information 19 can be included in the ruled line area MAof each unit image 10 (see FIG. 3A), and can also be included in theperipheral areas of each unit image 10 (the header area HA and thefooter area FA (see FIG. 3A)). Specifically, the write information 19written by the user includes characters written in the ruled line areaMA, main information such as figures (contents to be stored), andvarious sub-information (such as checking of the unnecessity check box14, filling of the mark areas 16, and writing in the title area 13)(FIG. 6A). Hereinafter, a case where three document image data each ofwhich represents a document image SI including four area images 20 areprocessed will be described as an example. In this case, twelve areaimages 20 are processed.

If the document image acquiring unit 110 receives one or more documentimage data from the multi-function apparatus 300 in Step S300, in StepS310, the document image processing unit 100 selects one document imagedata to be processed. In a case where there are plural document imagedata to be processed, the document image data are selected one by one inthe order of generation of the document image data. For example, in acase where the document image data are arranged in the order ofgeneration in a transmission file, the document image data are selectedone by one in the order of arrangement in the transmission file.

In Step S320, the generation order specifying unit 140 determines anumber for the selected document image data (a document image numberSN). Here, document image numbers SN are numbers to specify the documentimage data, and are attached in the order of generation. In the presentillustrative embodiment, since the document image data are selected inthe order of generation in Step S320, the document image numbers SN areattached in the order of selection in Step S320. In other words, thedocument image numbers SN make it possible to specify the order ofgeneration of the document image data.

In Step S330, the area image generating unit 160 detects markersincluded in the document image SI, and specifies the layout of thedocument image SI. Specifically, the image information acquiring unit120 analyzes the document image data to detect the top marker 11 andbottom marker 17 (see FIGS. 3A to 3C) of each unit image 10 included inthe document image SI. As the analyzing method to detect the markers,known pattern matching can be used. The area image generating unit 160specifies the number and positions of area images 20 (which are the sameas the number and positions of unit images 10) included in the documentimage SI, on the basis of positions where the markers 11 and 17 aredetected.

In Step S340, the page number acquiring unit 130 acquires a page number(a document image page number SPN) corresponding to each area image 20which is inside the document image SI and is represented by a documentimage data to be processed. Specifically, the page number acquiring unit130 analyzes the document image data to acquire the page number includedin the page information area 12 (see FIG. 3A) corresponding to each areaimage 20. The page number of each area image 20 may be included in theinformation embedded in the QR code 18 (see FIG. 3A) such thatcorrespondence to the corresponding area image 20 can be specified. Inthis case, the page number acquiring unit 130 may read the QR code 18 toacquire a page number corresponding to each area image 20.

In Step S350, the document image processing unit 100 determines whethereach area image 20 is a subject of skip. Specifically, the imageinformation acquiring unit 120 analyzes the document image data todetect whether there is write information (specifically, a check mark)written in an unnecessity check box 14 (see FIG. 3A) included in eacharea image 20. The unnecessity condition determining unit 150 determinesthat an area image 20 corresponding to a unnecessity check box 14 havingwrite information written therein is a subject of skip, and determinesthat an area image 20 corresponding to an unnecessity check box 14having no write information written therein is not a subject of skip. Anarea image 20 determined as a subject of skip is excluded from objectsto be stored in a PDF file to be generated later as will be describedbelow.

In Step S360, the image information acquiring unit 120 acquires akeyword associated with each area image 20 (an associated keyword KW).Specifically, the image information acquiring unit 120 analyzes thedocument image data to detect whether there are write information(specifically, filling) written in plural mark areas 16 (see FIG. 3A)included in each area image 20. The image information acquiring unit 120acquires keywords 15 (see FIG. 3A) corresponding to the mark areas 16having write information written therein, as the associated keywords KWassociated with corresponding area images 20. The contents of thekeywords 15 corresponding to the mark areas 16 are specified on thebasis of the predetermined correspondence between the positions of themark areas 16 and the contents of the keywords 15. In a case where thecorrespondence between the mark areas 16 and the keywords 15 is notrecognized by the image process server 200, the image informationacquiring unit 120 may perform a known character recognition process (anOCR process) on areas represented by the keywords 15 of image data whichare objects to acquire the contents of the keywords 15.

In Step S370, the document image processing unit 100 determines whetherall document image data received from the multi-function apparatus 300have been selected. In a case where there is any unselected documentimage data (NO in Step S370), the document image processing unit 100returns to Step S310 to select a new document image data, and repeatsthe processes of Steps S320 to S360. Meanwhile, in a case where alldocument image data have been selected (YES in Step S370), the documentimage processing unit 100 proceeds to Step S380.

Also, various information acquired or specified until Step S370 arerecorded in the management table MT temporarily generated in thevolatile storage device 230. FIG. 8 is a view illustrating an example ofthe management table MT. In the management table MT, for each of thetwelve area images 20 processed, the document image number SN, thedocument image page number SPN, skip information SK, a PDF file numberPN, a PDF page number PPN, and the associated keyword KW are recorded.As described above, the document image number SN is specified in StepS320, and the document image page number SPN is acquired in Step S340.The skip information SK is information representing the result of thedetermination on whether each area image 20 is a subject of skipperformed in Step S350. In the management table MT of FIG. 8, withrespect to an area image 20 which is a subject of skip, as the skipinformation SK, information representing that the corresponding areaimage is a subject of skip is recorded, and with respect to an areaimage 20 which is not a subject of skip, any information is notrecorded. As described above, the associated keyword KW is acquired inStep S360. The PDF file number PN and the PDF page number PPN have notbeen recorded until the process of Step S370 is terminated.

In Step S380, the file generating unit 170 determines a specification ofa PDF file to be generated. Specifically, the file generating unit 170determines a PDF file number PN and a PDF page number PPN for each areaimage 20 specified by the combination of a document image number SN anda document image page number SPN, on the basis of the information havingbeen recorded in the management table MT (FIG. 8). Hereinafter, adescription (SN-SPN) may be used to represent an area image 20 specifiedby the combination of a document image number SN and a document imagepage number SPN of the area image 20. For example, an area image 20specified by the combination of a document image number SN of 1 and adocument image page number SPN of 2 may be represented by an area image20 of (1-2).

Specifically, PDF file numbers PN and the PDF page numbers PPN aredetermined using the order of arrangement of the twelve area images 20in the management table MT, and the associated keyword KW of the twelvearea images 20. As can be seen from FIG. 8, each of the twelve areaimages 20 is earlier in the order of arrangement as the correspondingarea image 20 is earlier in the order of generation represented by thedocument image numbers SN, and in four area images 20 of the samedocument image SI, each area image 20 is earlier in the order ofarrangement as the document image page number SPN of the correspondingarea image 20 is smaller. First, an area image 20 determined as asubject of skip is excluded from the process, and with respect to thecorresponding area image 20, any PDF file number PN and any PDF pagenumber PPN are not attached. For two or more area images 20 (an areaimage group) which are consecutive in the order of arrangement of them,and have the same associated keyword KW, the same PDF file number PN isattached. Also, for two or more area images 20 having differentassociated keywords KW, different PDF file numbers PN are attached. In acase where there are two area images 20 having the same associatedkeyword KW, and an area image 20 having a different associated keywordKW is included in area images 20 positioned between the two area images20 in the order of arrangement, different PDF file numbers PN areattached to the two area images 20. In the example shown in FIG. 8, fivearea images 20 of (1-1) to (2-1) are consecutive in the order ofarrangement, and their associated keywords KW are the same as ‘MINUTES’.Therefore, the PDF file numbers PN of the five area images 20 aredetermined to 1. Two area images 20 of (2-2) and (2-3) are consecutivein the order of arrangement, and their associated keywords KW are thesame as ‘MEMO’. Therefore, the PDF file numbers PN of the two areaimages 20 are determined to 2. Four area images 20 from (2-4) to (3-3)are consecutive in the order of arrangement, and their associatedkeywords KW are the same as ‘IDEA’. Therefore, the PDF file numbers PNof the four area images 20 are determined to 3.

Also, the file generating unit 170 attaches PDF page numbers PPNbeginning with 1 to one or more area images 20 having the same PDF filenumber PN, in the order of arrangement. Specifically, as shown in FIG.8, to five area images 20 having PDF file numbers PN of 1, PDF pagenumbers PPN of 1 to 5 are attached in the order of arrangement. To twoarea images 20 having PDF file numbers PN of 2, PDF page numbers PPN of1 and 2 are attached in the order of arrangement. To four area images 20having PDF file numbers PN of 3, PDF page numbers PPN of 1 to 4 areattached in the order of arrangement.

In Step S390, the area image generating unit 160 performs an area imagedata generating process on the three document image data to generatearea image data representing the twelve area images 20 included in thethree document images SI. However, an area image data representing anarea image 20 of (3-4) determined as a subject of skip may not begenerated.

FIG. 9 is a flow chart illustrating the area image data generatingprocess. In Step S391, the area image generating unit 160 selects onedocument image data to be processed. In Step S392, the area imagegenerating unit 160 performs a tilt correcting process on the selecteddocument image data to correct the tilt of a document image SIrepresented by the document image data. The tilt correcting processincludes a process of computing a tilt angle based on the coordinates ofthe markers 11 and 17 included in the document image SI, and a rotatingprocess of rotating the document image SI by the computed tilt angle. Aspecific computing method may be performed using a known technique.

In Step S393, the area image generating unit 160 uses the document imagedata subjected to the tilt correcting process to specify a divisionposition in the document image SI. For example, the division positionmay be specified on the basis of the coordinates of the markers 11 and17 included in the document image SI. In a case where the document imageSI is a first type of document image SIA including two area images 20 asshown in FIG. 6B, a division position is a straight line connecting twomiddle points dividing two long sides of the first type of documentimage SIA almost equally, respectively. Also, in a case where thedocument image SI is a second type of document image SIB including fourarea images 20C to 20F as shown in FIG. 7B, division positions are astraight line connecting two middle points dividing two long sides ofthe second type of document image SIB almost equally, respectively, anda straight line connecting two middle points dividing two short sides ofthe second type of document image SIB almost equally, respectively.

In Step S394, the area image generating unit 160 divides the documentimage SI at the specified division position, and generates area imagedata representing individual area images 20 included in the documentimage SI. In the case where the document image SI is a first type ofdocument image SIA including two area images 20 as shown in FIG. 6B, twoarea image data representing the two area images 20A and 20B aregenerated. Also, in the case where the document image SI is a secondtype of document image SIB including four area images 20C to 20F asshown in FIG. 7B, four area image data representing the four area images20C to 20F are generated.

In Step S395, the area image generating unit 160 specifies an area image20 to be rotated, according to the layout of the document image SI.Specifically, in a case where the document image SI is the first type ofdocument image SIA shown in FIGS. 6A-6C, all area images, that is, thetwo area images 20A and 20B are specified as area images to be rotated.In a case where the document image SI is the second type of documentimage SIB shown in FIGS. 7A-7C, some area images, that is, two areaimages 20D and 20E having document image page numbers SPN of 2 and 3 arespecified as area images to be rotated.

In Step S396, the area image generating unit 160 performs the rotatingprocess of rotating the specified area image 20 by an amount of rotationaccording to the layout, on the area image data representing thespecified area image 20. Specifically, in the case where the documentimage SI is the first type of document image SIA shown in FIGS. 6A-6C,the two area images 20A and 20B are rotated 90 degrees clockwise. Also,if the reading direction of the image reading unit 380 during generatingof document image data is reversed, a first type of document image SIAmay be in an opposite direction to the state shown in FIG. 6B. In thiscase, the area image generating unit 160 needs only to rotate the twoarea images 20A and 20B 90 degrees counterclockwise. As a result, thetwo area image data are converted to represent the two area images 20Aand 20B in a state where the area images are oriented in appropriatedirections. A state where an image is oriented in an appropriatedirection may be a state where the image is oriented in a direction inwhich a user feels that the image should be oriented upward as seeingthe image. For example, in a case of a character image, a state wherethe image is oriented in an appropriate direction is a state where theupper sides of characters in the character image are oriented upward.For example, in a case of a unit image 10 included in an area image 20,a state where a direction shown by a broken line arrow in FIG. 3A isoriented upward is a state where the unit image is oriented in anappropriate direction.

Similarly, in a case where the document image SI is the second type ofdocument image SIB shown in FIGS. 7A-7C, the two area images 20D and 20Eare rotated 180 degrees. As a result, the four area images 20C to 20Fare oriented in the same direction, and the four area image data areconverted such that the four area images 20C to 20F are oriented inappropriate directions. Also, if the reading direction of the imagereading unit 380 during generating of document image data is reversed, asecond type of document image SIB may be in an opposite direction to thestate shown in FIG. 7B. In other words, in the document image SIB, thearea images 20C and 20F corresponding to page numbers of 1 and 4 may bepositioned on the upper side in FIGS. 7A-7C, and the area images 20D and20E corresponding to page numbers of 2 and 3 may be positioned on thelower side in FIGS. 7A-7C. In this case, the area image generating unit160 needs only to rotate the two area images 20C and 20F 180 degrees.

In Step S397, the area image generating unit 160 determines whether alldocument image data have been selected. In a case where there is anyunselected document image data (NO in Step S397), the area imagegenerating unit 160 returns to Step S391 to select a new document imagedata, and repeat the processes of Steps S392 to S396. Meanwhile, in acase where all document image data have been selected (YES in StepS397), the area image generating unit 160 terminates the area image datagenerating process.

If the area image data generating process terminates, in Step S400 ofFIG. 5, the file generating unit 170 uses the plural generated areaimage data to generate a file in a portable document format (PDF) (a PDFfile). The PDF file is a file having the concept of pages, and can storeone or more image data in association with an arbitrary page. In the PDFfile, with respect to an image represented by each image data associatedwith a page, it is possible to designate the layout position on thepage, a layout angle, and so on. The user can use a browsing program(viewer) to browse images represented by the image data stored in thePDF file like plural pages of sheets, on a display.

FIGS. 10A and 10B are views illustrating generation of a PDF file.Specifically, the file generating unit 170 generates three PDF filesPFA, PFB, and PFC according to the PDF file number PN and the PDF pagenumber PPN recorded for each area image 20 in the management table MTshown in FIG. 8. In the PDF file PFA, five area image data having PDFfile numbers PN of 1 are stored. Five area images 20 represented by thefive area image data stored in the PDF file PFA are disposed in pages ofcorresponding PDF page numbers PPN. Similarly, in the PDF file PFB, twoarea image data having PDF file numbers PN of 2 are stored, and the twoarea image data are disposed in pages of corresponding PDF page numbersPPN. In the PDF file PFC, four area image data having PDF file numbersPN of 3 are stored, and the four area image data are disposed in pagesof corresponding PDF page numbers PPN. Here, an area image datacorresponding to one area image 20 determined as a subject of skip isnot stored in the generated PDF files.

As a result, among the twelve area images 20 included in an image groupSIG (FIGS. 10A and 10B) composed of the three document images SI, elevenarea images 20 except for one area image 20 determined as a subject ofskip are classified into and stored in the three PDF files PFA, PFB, andPFC (FIG. 10B).

In Step S410 of FIG. 5, the document image processing unit 100 transmitsthe three PDF files PFA, PFB, and PFC to a storage destination, andterminates the division document generating process. In the presentillustrative embodiment, the storage destination is the storage server500. The document image processing unit 100 associates useridentification information (specifically, a user ID for using thestorage server 500) with the three PDF files PFA, PFB, and PFC, andtransmits them to the storage server 500. The CPU 510 of the storageserver 500 stores the three PDF files PFA, PFB, and PFC in a storageunit (not shown) of the storage server 500. The user can use a clientdevice such as the personal computer 400 or a portable terminal (notshown) to access to the PDF files PFA, PFB, and PFC stored in thestorage server 500.

According to the present illustrative embodiment described above, it ispossible to generate PDF files including plural area images 20 of adocument image SI disposed in different pages. Therefore, it is possibleto store the document image SI including the plural area images 20 in aconvenient form.

Also, the page number acquiring unit 130 acquires plural page numberinformation (specifically, numerals in page information areas 12 (FIGS.3A to 3C)) corresponding to plural area image data by image analysis.Then, the file generating unit 170 generates PDF files including thearea image data disposed in the acquired page number order (FIGS. 8 and10). Therefore, it is possible to generate convenient PDF filesincluding the area image data disposed in the page number order.

Further, in the present illustrative embodiment, the document imageprocessing unit 100 generates N-number of (eleven in the examples ofFIGS. 8 and 10) area image data, and generates M-number of (three in theexamples of FIGS. 8 and 10) PDF files each of which includes one or morearea image data of the N-number of area image data. At least one file(three PDF files in the examples of FIGS. 8 and 10) of the M-number ofPDF files includes two or more area images 20 corresponding to two ormore area image data of the N-number of area image data and disposed indifferent pages. That is, it is possible to use document image data togenerate two or more PDF files. As a result, it is possible to generatea convenient number of PDF files according to document image data. Also,it is preferable that N should be an integer equal to or greater than 3,and it is preferable that M should be an integer from 2 to (N−1). It canbe seen from FIG. 8 that two area images 20 corresponding to two areaimage data included in the PDF file PFB are area images havingconsecutive page numbers (document image page numbers SPN) in thedocument image SI.

The image information acquiring unit 120 analyzes the document imagedata to acquire plural associated keywords KW corresponding to theplural area image data. The document image processing unit 100 can usethe plural associated keywords KW to generate the PDF file PFA includingplural area image data corresponding to plural area images 20 havingassociated keywords KW of ‘MINUTES’, and the PDF file PFB includingplural area image data corresponding to plural area images 20 havingassociated keywords KW of ‘MEMO’ (FIGS. 8 and 10). That is, it ispossible to use specific type information such as an associated keywordKW included in each area image 20 to generate plural PDF files.Therefore, it is possible to store a document image SI in a moreconvenient form.

The document image acquiring unit 110 acquires two ore more documentimage data (for example, document image data having document imagenumbers SN of 2 (FIG. 8)) including first document image datarepresenting first document images (for example, document image datahaving document image numbers SN of 1 (FIG. 8)) and second documentimage data representing second document images. Then, the area imagegenerating unit 160 uses the first document image data to generateP-number of (for example, four (FIG. 8)) first area image data (forexample, four area image data which are generated by dividing documentimage data having document image numbers SN of 1) representing P-numberof area images 20 included in the first document images, respectively.The area image generating unit 160 uses the second document image datato generate P-number of second area image data (for example, four areaimage data which are generated by dividing document image data havingdocument image numbers SN of 2) representing P-number of area images 20included in the second document images, respectively.

Also, the file generating unit 170 generates files (for example, thethree PDF files PFA, PFB, and PFC (FIGS. 10A and 10B)) which include theP-number of first area image data of the first document images and theP-number of second area image data of the second document imagesdisposed in 2×P-number of different pages. As a result, it is possibleto generate files including 2×P-number of area image data included intwo or more document image data. In this case, the 2×P-number of areaimages 20 are disposed in the PDF files in the page order determined onthe basis of first page number information, second page numberinformation, and the order of generation of the first document imagedata and the second document image data (the order of document imagenumbers SN). Therefore, it is possible to store plural document imagesSI in a convenient form according to the order of generation and thepage numbers.

A document image SI including four area images 20 includes first areaimages disposed in opposite directions to each other (for example, thearea images 20C and 20F having page numbers of 1 and 4 (FIG. 7)), andsecond area images (for example, the area images 20D and 20E having pagenumbers of 2 and 3 (FIG. 7)). According to the present illustrativeembodiment, the file generating unit 170 generates PDF files such thatthe first area images and the second area images are oriented in thesame direction. As a result, it is possible to generate files capable ofconspicuously displaying document images including first area images andsecond area images disposed in opposite directions to each other in adocument image SI.

Also, the unnecessity condition determining unit 150 determines whethereach of plural area images satisfies a condition to become a subject ofskip, on the basis of existence or non-existence of a check mark in anunnecessity check box 14 included in each area image 20. In a case wherethere is an area image 20 satisfying the skip condition, the filegenerating unit 170 generates PDF files which do not include an areaimage data representing the area image 20 satisfying the skip condition.As a result, in a case of displaying the contents of the PDF files, theunnecessary image is not displayed, and thus convenience is furtherimproved.

B. Modification

(1) In the above-mentioned illustrative embodiment, a page number isprinted in the page information area 12. Alternatively, the pageinformation area 12 may be printed as a blank, and the user may write apage number therein. In this case, the user can easily designate a page,so that convenience can be improved.

(2) In the above-mentioned illustrative embodiment, as the specific typeinformation, the associated keywords KW are used to determine the numberof PDF files to be generated, and area image data to be stored in eachPDF file. The specific type information is not limited to the associatedkeywords KW, but may be a title written in the title area 13. Here, asshown in Modification (1), in a case where the user can write a pagenumber, as the specific type information, the page number written by theuser may be used. For example, in a case where four page numbers writtenin four page information areas 12 of four area images 20 included in asecond type of memo sheet MSB are 1, 2, 1, and 2, respectively, and apair of 1 and 2 is written in red, and the other pair of 1 and 2 iswritten in blue, the file generating unit 170 may generate a filecorresponding to the page numbers written in red, and a filecorresponding to the page numbers written in blue.

(3) In the above-mentioned illustrative embodiment, the file generatingunit 170 does not store an area image data representing an area image 20satisfying the skip condition, in PDF files. Alternatively, the filegenerating unit 170 may store an area image data representing an areaimage 20 satisfying the skip condition, in a PDF file, and storeinformation (comment) representing that the area image 20 satisfying theskip condition will not be disposed (displayed) in any page, in theheader of the PDF file. Even in this case, in a case of displaying thecontents of the PDF file, the unnecessary image is not displayed. Also,the skip condition may be whether the page information area 12 isfilled, whether an “X” mark larger than a reference is written in theruled line area MA, or whether any information is not actually writtenin the ruled line area MA by the user. Generally, it is preferable thatthe unnecessity condition determining unit 150 determine whether each ofplural area images satisfies a specific skip condition, on the basis ofwrite information of a specific area included in each of plural areaimages, and in a case where a specific area image satisfying thespecific skip condition is included, the file generating unit 170generate files such that the specific area image will not be displayed.

(4) In Step S396 of the above-mentioned illustrative embodiment, thearea image generating unit 160 performs the rotating process of rotatinga specific area image 20 by an amount of rotation (angle) according tothe layout. Alternatively, the file generating unit 170 may store anarea image data representing the specific area image 20, in a PDF file,without performing the rotating process, and store informationrepresenting that the specific area image 20 will be rotated by theamount of rotation according to the layout and be displayed in therotated state, in the header of the PDF file. Even in this case, thefile generating unit 170 can generate a PDF file such that all areaimages represented by plural area image data in the PDF file aredisplayed in the same direction.

(5) The file generating unit 170 may generate files having other fileformats, in place of PDF files. Other file formats have the concept ofpages, and may include file formats in which it is possible to disposeplural image data in a plural different pages, for example, a XML PaperSpecification (XPS) format.

(6) The image reading unit for generating document image data needs onlyto be a device for optically reading a document. Therefore, the imagereading unit is not limited to the scanner of the illustrativeembodiment, but may be a digital camera.

(7) In the above-mentioned illustrative embodiment, the imageinformation acquiring unit 120 analyzes document image data representinga document image SI to perform acquisition of various information, suchas detection of the markers 11 and 17, and detection of existence ornon-existence of write information into the mark areas 16 and theunnecessity check box 14. Alternatively, the image information acquiringunit 120 may analyze area image data generated by dividing the documentimage SI by the area image generating unit 160 to acquire the samevarious information. The image information acquiring unit 120 mayanalyze at least one of the document image data and the area image datato acquire the same various information.

(8) The function of the document image processing unit 100 of the imageprocess server 200 in the above-mentioned illustrative embodiment may beprovided in the multi-function apparatus 300. In this case, themulti-function apparatus 300 uses the function of the document imageprocessing unit 100 which the multi-function apparatus 300 has, toperform the above-mentioned process on the document image data which theimage reading unit 380 has generated in response to an instruction ofthe user to generate a PDF file. Also, the personal computer 400 mayhave the function of the document image processing unit 100. The PDFfile storage destination is not limited to the storage server 500, butmay be a storage device included in the case of the personal computer400 or the multi-function apparatus 300, a memory card such as a USBmemory connected to the personal computer 400 or the multi-functionapparatus 300, or the image process server 200.

(9) The image process server 200 is not limited to a device housed inone casing, but may be a device configured by plural computers housed inindividual casings (so-called cloud server).

(10) A part of a configuration implemented by hardware in theabove-mentioned illustrative embodiment may be replaced with software,and in contrast, a part of a configuration implemented by software inthe above-mentioned illustrative embodiment may be replaced withhardware.

While the present invention has been shown and described with referenceto certain illustrative embodiments thereof, it will be understood bythose skilled in the art that various changes in form and details may bemade therein without departing from the spirit and scope of theinvention as defined by the appended claims.

What is claimed is:
 1. An image processing apparatus comprising: aprocessor; and memory storing computer readable instructions, whenexecuted by the processor, causing the apparatus to function as: adocument image acquiring unit configured to acquire document image datawhich is generated by optically reading a document and represents adocument image; an area image generating unit configured to use thedocument image data to generate plural area image data representingplural area images that are included in the document image; and a filegenerating unit configured to generate a file including the plural areaimage data such that the plural area images are disposed in differentpages, wherein the area image generating unit is configured to use onedocument image data to generate N-number of area image data, where N isan integer equal to or greater than 3, wherein the file generating unitis configured to generate M-number of files each of which includes oneor more of the N-number of area image data, where M is an integer from 2to (N−1), and wherein at least one of the M-number of files is a file inwhich two or more area images corresponding to two or more area imagedata of the N-number of area image data are disposed in different pages.2. The image processing apparatus according to claim 1, wherein thecomputer readable instructions, when executed by the processor, causingthe apparatus to further function as: a page number acquiring unitconfigured to acquire plural page number information corresponding tothe plural area image data by analyzing at least one of the documentimage data and the plural area image data, and wherein the filegenerating unit is configured to use the plural page number informationto generate the file including the plural area images disposed in anorder of page numbers.
 3. The image processing apparatus according toclaim 1, wherein computer readable instructions, when executed by theprocessor, causing the apparatus to function as: a specific typeacquiring unit configured to acquire plural specific type informationcorresponding to the plural area image data by analyzing at least one ofthe document image data and the plural area image data, wherein the filegenerating unit is configured to use the plural specific typeinformation to generate the M-number of files comprising a first fileincluding two or more area image data of a first type and a second fileincluding one or more area image data of a second type, wherein each oftwo or more first-type area images represented by the two or more areaimage data of the first type includes first information as the specifictype information, and wherein each of one or more second-type areaimages represented by the one or more area image data of the second typeincludes second information as the specific type information.
 4. Theimage processing apparatus according to claim 1, wherein the documentimage includes a first area image and a second area image oriented indifferent directions, and wherein the file generating unit is configuredto generate the file such that the first area image and the second areaimage are displayed in a same direction.
 5. The image processingapparatus according to claim 1, wherein the document image includeswrite information in the document, wherein computer readableinstructions, when executed by the processor, causing the apparatus tofurther function as: a condition determining unit configured todetermine whether each of the plural area images satisfies a specificcondition based on the write information in a specific area that isincluded in a corresponding area image, and wherein if the plural areaimages includes a specific area image satisfying the specific condition,the file generating unit is configured to generate the file such thatthe specific area image is not displayed.
 6. An image processingapparatus comprising: a processor; and memory storing computer readableinstructions, when executed by the processor, causing the apparatus tofunction as: a document image acquiring unit configured to acquiredocument image data which is generated by optically reading a documentand represents a document image; an area image generating unitconfigured to use the document image data to generate plural area imagedata representing plural area images that are included in the documentimage; and a file generating unit configured to generate a fileincluding the plural area image data such that the plural area imagesare disposed in different pages, wherein the document image acquiringunit is configured to acquire two or more document image data includingfirst document image data representing a first document image, andsecond document image data representing a second document image, whereinthe area image generating unit is configured to use the first documentimage data to generate P-number of first area image data representingP-number of area images that are included in the first document image,and configured to use the second document image data to generateP-number of second area image data representing P-number of area imagesthat are included in the second document image, and wherein the filegenerating unit is configured to generate a file including the P-numberof first area image data and the P-number of second area image data suchthat the P-number of area images included in the first document imagesand the P-number of area images included in the second document imagesare disposed in 2×P-number of different pages.
 7. The image processingapparatus according to claim 6, wherein the computer readableinstructions, when executed by the processor, causing the apparatus tofurther function as: a page number acquiring nit configured to acquireP-number of first page number information corresponding to the P-numberof first area image data by analyzing at least one of the first documentimage data and the P-number of first area image data, and configured toacquire P-number of second page number information corresponding to theP-number of second area image data by analyzing at least one of thesecond document image data and the P-number of second area image data;and a generation order specifying unit configured to specify an order ofgeneration of the first document image data and the second documentimage data, and wherein the file generating unit is configured togenerate a file including the P-number of area images included in thefirst document image and the P-number of area images included in thesecond document image which are disposed in an order of pages determinedbased on the first page number information, the second page numberinformation, and the order of generation of the first document imagedata and the second document image data.
 8. The image processingapparatus according to claim 6, wherein the computer readableinstructions, when executed by the processor, causing the apparatus tofurther function as: a page number acquiring unit configured to acquireplural page number information corresponding to the plural area imagedata by analyzing at least one of the document image data and the pluralarea image data, and wherein the file generating unit is configured touse the plural page number information to generate the file includingthe plural area images disposed in an order of page numbers.
 9. Theimage processing apparatus according to claim 6, wherein the documentimage includes a first area image and a second area image oriented indifferent directions, and wherein the file generating unit is configuredto generate the file such that the first area image and the second areaimage are displayed in a same direction.
 10. The image processingapparatus according to claim 6, wherein the document image includeswrite information in the document, wherein computer readableinstructions, when executed by the processor, causing the apparatus tofurther function as: a condition determining unit configured todetermine whether each of the plural area images satisfies a specificcondition based on the write information in a specific area that isincluded in a corresponding area image, and wherein if the plural areaimages includes a specific area image satisfying the specific condition,the file generating unit is configured to generate the file such thatthe specific area image is not displayed.
 11. A non-transitorycomputer-readable medium having a computer program stored thereon andreadable by a computer, the computer program, when executed by thecomputer, causing the computer to functions as: a document imageacquiring unit configured to acquire document image data which isgenerated by optically reading a document and represents a documentimage; an area image generating unit configured to use the documentimage data to generate plural area image data representing plural areaimages that are included in the document image; and a file generatingunit configured to generate a file including the plural area image datasuch that the plural area images are disposed in different pages,wherein the area image generating unit is configured to use one documentimage data to generate N-number of area image data, where N is aninteger equal to or greater than 3, wherein the file generating unit isconfigured to generate M-number of files each of which includes one ormore of the N-number of area image data, where M is an integer from 2 to(N−1), and wherein at least one of the M-number of files is a file inwhich two or more area images corresponding to two or more area imagedata of the N-number of area image data are disposed in different pages.12. The non-transitory computer-readable medium according to claim 11,wherein computer readable instructions, when executed by the processor,causing the apparatus to function as: a specific type acquiring unitconfigured to acquire plural specific type information corresponding tothe plural area image data by analyzing at least one of the documentimage data and the plural area image data, wherein the file generatingunit is configured to use the plural specific type information togenerate the M-number of files comprising a first file including two ormore area image data of a first type and a second file including one ormore area image data of a second type, wherein each of two or morefirst-type area images represented by the two or more area image data ofthe first type includes first information as the specific typeinformation, and wherein each of one or more second-type area imagesrepresented by the one or more area image data of the second typeincludes second information as the specific type information.
 13. Anon-transitory computer-readable medium having a computer program storedthereon and readable by a computer, the computer program, when executedby the computer, causing the computer to functions as: a document imageacquiring unit configured to acquire document image data which isgenerated by optically reading a document and represents a documentimage; an area image generating unit configured to use the documentimage data to generate plural area image data representing plural areaimages that are included in the document image; and a file generatingunit configured to generate a file including the plural area image datasuch that the plural area images are disposed in different pages,wherein the document image acquiring unit is configured to acquire twoor more document image data including first document image datarepresenting a first document image, and second document image datarepresenting a second document image, wherein the area image generatingunit is configured to use the first document image data to generateP-number of first area image data representing P-number of area imagesthat are included in the first document image, and configured to use thesecond document image data to generate P-number of second area imagedata representing P-number of area images that are included in thesecond document image, and wherein the file generating unit isconfigured to generate a file including the P-number of first area imagedata and the P-number of second area image data such that the P-numberof area images included in the first document images and the P-number ofarea images included in the second document images are disposed in2×P-number of different pages.
 14. The non-transitory computer-readablemedium according to claim 13, wherein the computer readableinstructions, when executed by the processor, causing the apparatus tofurther function as: a page number acquiring nit configured to acquireP-number of first page number information corresponding to the P-numberof first area image data by analyzing at least one of the first documentimage data and the P-number of first area image data, and configured toacquire P-number of second page number information corresponding to theP-number of second area image data by analyzing at least one of thesecond document image data and the P-number of second area image data;and a generation order specifying unit configured to specify an order ofgeneration of the first document image data and the second documentimage data, and wherein the file generating unit is configured togenerate a file including the P-number of area images included in thefirst document image and the P-number of area images included in thesecond document image which are disposed in an order of pages determinedbased on the first page number information, the second page numberinformation, and the order of generation of the first document imagedata and the second document image data.